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Perturbation  Bounds  for 


Let  A and  B be  Hermitian  matrices  and  let  c(A,B)  - inf{|x  (A*iB)x| 
: ||x||*lh  The  eigenvalue  problem  Ax  - XBx  is  called  definite  if 
c(A,B)  > 0.  It  is  shown  that  a definite  problem  has  a conplete 
system  of  eigenvectors  and  its  eigenvalues  are  real.  Under  pertur- 
bations of  A and  B,  the  eigenvalues  behave  like  the  eigenvalues  of 
a Hermitian  matrix  in  the  sense  that  there  is  a 1-1  pairing  of  the 
eigenvalues  with  the  perturbed  eigenvalues  and  a uniform  bound  for 
their  differences  (in  this  case  in  the  chordal  metric) . Perturbation 
bounds  are  also  developed  for  eigenvectors  and  eigenspaces. 
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Perturbation  Bounds  for 


the  Definite  Generalized  Eigenvalue  Problem 
G.  W.  Stewart 


1.  Introduction 

In  this  paper  we  shall  be  concerned  with  deriving  perturbation  bowds 
for  the  eigenvalues  and  eigenvectors  of  the  generalized  eigenvalue  problem 

(1.1)  Ax  ■ XBx  , 

where  A and  B are  Hermitian  matrices  of  order  n.  When  B is  positive 
definite,  as  it  is  in  most  applications,  the  problem  can  be  reduced  to  a 
Hermitian  eigenvalue  problem  of  the  form 

(1.2)  B'1/2AB'1/2y  - Xy  , 

where  B1^2  is  the  positive  definite  square  root  of  B.  Thus,  in  this 
case,  the  eigenvalues  are  real. 

When  A and  B are  replaced  by  A = A ♦ E and  B * B ♦ F,  the  eigen- 
values and  eigenvectors  will  be  perturbed  by  quantities  that  are  functions 
of  E and  F.  In  principle  one  can  apply  the  existing  theory  for  Hermi- 
tian  eigenvalue  problems  to  (1.2)  and  obtain  bounds  on  these  perturbations; 
however,  this  approach  is  unsatisfactory  for  a number  of  reasons,  which  we 
shall  now  sketch  (for  a more  conplete  discussion  and  examples  see  [3,10,12]). 

When  B is  ill  conditioned,  that  is  when  B is  relatively  near  a 
singular  matrix,  the  eigenvalues  of  (1.1)  will  typically  be  spread  out,  with 
some  small  and  some  large.  The  small  ones  may  be  relatively  insensitive 
to  perturbations  in  A and  B.  However,  since  the  problem  (1.2)  has  large 


eigenvalues,  B1/4AB  ' l must  also  be  large,  and  so  will  its  corresponding 
perturbation.  The  perturbation  theory  for  (1.2)  will  then  predict  large 
perturbations  for  all  the  eigenvalues,  even  the  small  ones.  A second 
difficulty  is  that  the  perturbation  in  B must  be  restricted  so  that  B 
remains  positive  definite,  even  though  perturbations  that  make  B indefinite 
may  have  little  effect  on  some  of  the  eigenvalues.  Finally,  although  the 
large  eigenvalues  of  the  problem  usually  undergo  large  perturbations,  their 
reciprocals  will  undergo  only  small  perturbations.  This  suggests  that  the 
usual  Euclidean  metric  on  the  line  is  not  appropriate  for  reporting  the 
sizes  of  the  perturbations  in  the  eigenvalues. 

In  [12]  the  author  has  developed  a perturbation  theory  for  the  non- 
Hermitian  generalized  eigenvalue  problem  that  circumvents  these  difficulties, 
first  by  avoiding  the  use  of  inverses  and  second  by  using  the  chordal 
metric  on  the  Riemann  sphere  (cf.  Section  3).  In  [3]  Crawford  has  described 
a class  of  Hermitian  problems,  called  definite  problems  in  this  paper,  that 
admit  of  a nice  perturbation  theory  in  the  chordal  metric.  It  is  the  pur- 
pose of  this  paper  to  strengthen  and  extend  Crawford's  results.  In  particu- 
lar we  shall  obtain  the  one-one  pairing  of  eigenvalues  with  their  perturbed 
counterparts  that  holds  for  the  Hermitian  eigenvalue  problem.  In  addition 
we  shall  obtain  perturbation  bounds  for  eigenspaces  that  are  related  to 
the  perturbation  bounds  of  Davis  and  Kahan  [4]  for  invariant  subspaces  of 
Hermitian  operators. 

Throughout  this  paper  A and  B will  be  Hermitian  matrices  of  order 
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A * A ♦ E,  B * B ♦ F 


will  be  Hermitian  perturbations  of  A and  B.  The  Euclidean  vector  norm 
and  the  spectral  matrix  norm  will  both  be  denoted  by  || - 1| . 


2.  The  geometric  theory 

Recently  there  has  emerged  an  elegant  geometric  theory  for  the  real 

symmetric  generalized  eigenvalue  problem  that  is  based  on  the  quadratic 
T 

form  x (A+iB)x.  For  topological  reasons,  this  theory  does  not  apply  when 
n = 2.  In  this  section  we  shall  extend  this  theory  to  the  Hermitian  problem 
in  such  a way  that  the  restriction  n ^ 2 is  removed.  We  shall  return  to 
the  real  case  at  the  end  of  the  section. 

Our  ultimate  goal  is  to  replace  the  generalized  eigenvalue  problem 
(1)  by  an  equivalent  problem  in  which  B is  positive  definite.  Specifi- 
cally for  any  real  <p  let 

A = A cos  <p  - B sin  <p  , 

<P 

B = A sin  (p  + B cos  <p  . 

<P 

Then  it  is  obvious  that  any  matrix  X for  which  X^AX  and  X^BX  are 
diagonal  also  diagonalizes  A^  and  B^.  Thus  (1.1)  has  a complete  set 
of  eigenvectors  if  and  only  if  the  problem  A^x  = XB^x  has  the  same  set. 

We  shall  attempt  to  choose  <p  so  that  B^  is  positive  definite.  The 
condition  under  which  this  can  be  done  is  that  the  pair  (A,B)  be  definite 
in  the  following  sense. 


(2.1) 


4 


Definition  2.1. 


The  pair  of  Hermitian  matrices  (A,B)  is  definite  if 


c(A,B) 


e inf  { |xH(A*iB)x| ) > 0 . 
I|x||-1 


The  eigenvalue  problem  (1.1)  is  definite  if  (A,B)  is  definite. 

The  proof  that  c(A,B)  > 0 implies  that  B^  is  positive  definite 
for  some  p involves  the  geometry  of  the  set  over  which  the  infimum 
in  (2.1)  is  taken.  Let 


V - (x^(A+iB)x  : ||x||-l) 


and 

C = {xH(A+iB)x  : x € C11} 
s (av  : v € V,  a > 0 } . 

Then  c(A,B)  = inf  (|v|  : v ( V).  Moreover,  C is  the  pointed  cone  genera- 
ted by  the  closed,  bounded  set  C.  We  may  now  state  our  fundamental  theorem. 

Theorem  2.2.  The  cone  C is  convex.  If  (A,B)  is  definite, 
then  C lies  properly  in  a closed  half  plane  passing  through  the  origin. 
Moreover  there  is  a real  number  <p  such  that  is  positive  definite  and 

c(A,B)  - xnln(iy  , 

where  X • (B  ) denotes  the  smallest  eigenvalue  of  B . 
min'  ip'  <P 

Proof.  The  set  V is  just  the  field  of  values  of  A ♦ iB,  which 


- 


is  known  to  be  convex  (e.g.  see  [7]).  Hence  the  cone  C generated  by  V 
is  convex.  Now  suppose  that  c(A,B)  > 0.  Let  the  infimum  in  (2.1)  be 


Since  V is  convex 
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attained  at  xQ  and  let  vQ  - Xg(A*iB)xg 
v0  ( V satisfies  |vQ|  ■ inf  {|v|  : v € V)  » c(A,B)  > 0,  V must  lie  in 
the  closed  half  plane  H - {z  : R(vQz)  2 | V0I • which  does  not  contain  the 
origin.  Since  V is  bounded,  it  must  be  contained  in  a cone  that  is 
properly  contained  in  the  half  plane  H - vQ,  which  passes  through  the 
origin. 

To  prove  the  last  part  of  the  theorem,  let  V^,  C^,  and  denote 
the  field  of  values,  the  cone,  and  the  half  plane  associated  with  the 
pair  (A^,B^).  Since 

A ♦ iB  * e^(A+iB)  , 

<P  <p 

it  follows  that  V , C , and  H are  just  V,  C,  and  H rotated  clock- 
<p  <p  <P 

wise  through  the  angle  <p.  Choose  ip  so  that  lies  in  the  upper 
half  plane.  Hien  x^A  xQ  - 0.  Moreover  for  ||x||  - 1,  x^x  > c(A^,B^) 

- c(A,B).  Hence 

0 < c(A,B)  = x!|b  xn  * inf  xHBx  . 

° ° llxll-1 

This  shows  that  B is  positive  definite,  Xg  is  an  eigenvector  of 
corresponding  to  ^(B^),  and  c(A,B)  'X 

Since  in  the  above  is  positive  definite,  the  Hermitian  matrix 
B’1//2A  B-1/2  exists  and  can  be  diagonalized  by  a unitary  matrix  Y.  It 

ip  <p  cp 

is  easily  verified  that  X - B^1^  diagonalizes  both  A^  and  B^,  and 
hence  A and  B.  This  proves 


Corollary  2.3.  If  the  problan  (1.1)  is  definite,  then  there  is  a 
nonsingular  matrix  X such  that  X**AX  and  X**BX  are  both  diagonal. 

If  we  set  X*lAX  - diag(iu  • • • »nn)  ®nd  *BX  * diagfvj.v^, . . . ,vn)  , 

then  the  eigenvalues  of  (1.1)  are  given  by  X.  - u^/v^.  This  admits  the 
possibility  of  infinite  eigenvalues  when  ■ 0;  however,  the  indeter- 
minant case  (i.  * vi  * 0 cannot  occur  in  a definite  problem. 

The  number  c(A,B)  will  play  an  important  role  in  the  perturbation 
theory  of  the  next  two  sections.  The  following  theorem  shows  that  it 
does  not  vary  wildly  with  perturbations  in  A and  B.  This  result  is 
required  in  Section  3,  where  it  is  the  number  c(A,B)  that  appears  in 
the  bounds. 

Theorem  2.4. 

c(A,B)  > C(A,B)  - [||E||2*||F||2]1/2  . 

Proof. 

c(A,B)  = inf  { (xH(A-*-E)x]2  ♦ [xH(B*F)x]2)1/2 
llxll-1 

> inf  [(xHAx)2+(xH‘Bx)2]1/2  - sup  [(xHEx)2+(x,,Fx)2] 

llxll-1  llxll-1 

> c(A,B)  - [ sup  (x**Ex)2-*-  sup  (x^Fx)2]^2 

llxll-1  ||x||-l 

= c(A,B)  - [||E||2f||F||2]  1/2.o 

We  turn  now  to  the  case  where  A and  B are  real.  It  is  natural  in 
this  case  to  restrict  the  vector  x in  (2.1)  to  lie  in  R11  and  define 
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Vr  - {x1  (A*iB)x  : llxll-1,  x ( Rn) 
and 

cr(A,B)  - inf  { | v|  : v ( Vr)  . 

Brickman  [1]  has  shown  that  if  n i 2,  then  Vr  * V,  and  consequently 
cf(A,B)  - c(A,B).  It  follows  that  rheorem  2.2  is  valid  for  n i 2,  and 
in  particular  we  can  restrict  our  attention  to  real  vectors.  That  this 
happy  state  of  affairs  does  not  hold  for  n - 2 is  shown  by  the  example 


for  which  c(A,B)  * 0 < 1 ■ cr(A,B).  However,  we  have  the  following 
corollary  of  Theorem  2.2. 

Corollary  2.5.  For  n = 2,  if  c(A,B)  > 0 then  cr(A,B)  - c(A,B). 

Proof.  By  Theorem  2.2  we  may  find  a <p  such  that  B^  is  positive 
definite  and  an  eigenvector  x of  B^  such  that  ||x||-l  and  x**B^x  • c(A,B). 

U 

Since  x A^x  =0,  A^  is  indefinite.  Now  if  the  eigenvalue  c(A,B)  is  of 
multiplicity  one,  then  x must  be  a scalar  multiple  of  a real  vector  y, 
which  then  satisfies  y A^y  = 0 and  y B^y  * c(A,B).  If  c(A,B)  is  of  multi- 
plicity two,  then  any  nonzero  vector  is  an  eigenvector.  Since  A is  real 

and  indefinite,  there  is  a vector  y of  norm  unity  such  that  y A^y  - 0 
T 

and  y B^y  = c(A,B).o 

For  the  real  case,  much  of  the  material  in  this  section  is  implied  by 
earlier  work  [1,2,3,5,6,14].  Hestenes  [6]  has  characterized  C as  in 


Theorem  2.2,  and  Crawford  [3)  appears  to  be  the  first  to  introduce  the 
rotated  problem.  I am  indebted  to  Hans  Schneider  for  pointing  out  that 
the  restriction  n i 2 can  be  removed  in  the  con^lex  case.  The  charac- 
terization of  c(A,B)  as  the  smallest  eigenvalue  of  a positive  definite 
B is  new  and  replaces  an  incorrect  statement  in  Crawford's  paper. 

The  number  c(A,B)  appears  repeatedly  in  the  bounds  to  be  derived 
in  the  next  sections,  which  is  why  we  have  taken  some  pains  to  ascertain 
when  the  computationally  simpler  nunber  cr(A,B)  is  equal  to  c(A,B). 
Crawford  was  the  first  to  realize  the  importance  of  the  value  of  c(A,B), 
as  opposed  to  the  relation  c(A,B)  > 0,  and  for  this  reason  it  is  appro- 
priate to  refer  to  c(A,B)  as  the  Crawford  number  of  the  problem  (1.1). 

3.  Perturbation  bounds : eigenvalues 

In  this  section  we  shall  develop  perturbation  bounds  for  the  eigen- 
values of  the  definite  generalized  eigenvalue  problem.  For  the  Hermitian 
eigenvalue  problem  Ax  = Xx,  it  is  well  known  t'.iat  if  the  eigenvalues  are 
ordered  so  that  - ^2  ” * 5 Xn  and  t*lose  the  Perturbed  problem 

Ax  = Xx  are  ordered  so  that  X.  5 X,  S ...  < X then 

1 i.  n 

|Xi-Xi|  < HA-AII  , (i-1,2 n)  . 

We  shall  obtain  a similar  pairing  of  eigenvalues  with  their  perturbed 
counterparts;  however  the  ordering  that  defines  the  pairing  must  be 
defined  in  terms  of  certain  angles  associated  with  the  eigenvalues  of  (1.1). 
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Let  the  pair  (A,B)  be  definite  and  let  II H|| 2 ♦ II F|| “ < c(A,B), 
so  that  by  Theorem  2.4  the  pair  (A, 6)  is  also  definite.  Let  C and 
C denote  the  cones  associated  with  the  two  pairs.  Then  by  Theorem  2.2 
the  complement  of  C U C contains  a ray  R extending  from  the  origin. 

For  each  nonzero  point  u + iv  € C define  8(u,v)  as  the  angle  subtended  by 
R and  (a(u+iv)  : a > 0}  measured  clockwise.  By  construction,  6 is 
continuous  on  C U C. 

Now  let  Ax^  ■ X-Bxit  where  x-  + 0.  We  define  the  angle  associated 
with  X^  to  be 

ei  » e(x^Axi,x,i,Bxi)  , 

and  throughout  the  rest  of  this  section  we  shall  assune  that  8^  5 6^  5 ... 
i 6n-  If  B is  positive  definite,  this  corresponds  to  the  natural 
ordering  s ...  s X^  of  the  eigenvalues.  However,  if  B is  not 

positive  definite,  this  need  not  be.  For  example,  if  C lies  in  the  right 
half  plane  then  the  positive  eigenvalues  will  precede  the  negative  ones. 

The  8^  have  a characterization  that  is  completely  analogous  to  the 
min-max  characterization  of  the  eigenvalues  of  the  Hermitian  problem. 

Theorem  3.1.  Let  the  problem  (1.1)  be  definite  and  let  its  eigen- 
values be  ordered  to  that  8.  5 8,  5 ...  s 8.  Then 

12  n 


(3.1) 


T T 

min  max  8(x  Ax,x  Bx) 
dim(X)=i  x€X 
x?0 
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(3.2)  0.  - max  min  0(x1Ax,x1Bx)  . 

1 dim(X)«*n-i*l  x€X 
xj*0 

Proof.  As  in  Section  2,  choose  <p  so  that  is  positive  definite. 
Let  the  ray  defining  0 be  rotated  counterclockwise  through  <p  and  use 

this  new  ray  to  define  a new  function  0 . Then 

' <p 


0^  (x^x.x^x)  * 0 (x^Ax.x^Bx)  , 


and  without  loss  of  generality  we  may  drop  the  subscripts  <p  and  assume 
that  B is  positive  definite. 

For  B positive  definite  we  have  [7] 


X. 

l 


H. 

X Ax 

min  max  -n — 
dim(X)=i  x€X  xnBx 
x>«0 


But  because  V is  in  the  upper  half  plane,  the  mapping  X -*■  0(X,1)  is 
increasing . Hence 


0.  = 0(X.,1)  = min  max  0 
1 1 dim(X)«i 

x^O 


/x^Ax 

dim(X)*i  x(X  \ x^Bx 


, 1 


H H 

= min  max  0(x  Ax,x  Bx)  . 
dim(X)=i  x€X 
x/0 

A proof  of  (3.2)  follows  from  the  characterization 

xHAx 

X.  = min  min  -n — o 

1 dim(X)=n-i+l  x€X  xlx 
x/0 


As  a consequence  of  Theorem  3.1,  a number  of  separation  theorems  for 
the  eigenvalues  of  symnetric  matrices  generalize  to  the  definite  problem 
(1.1).  For  exanple,  if  A and  B are  principal  submatrices  of  A and 

B of  order  n-1,  then  the  eigenvalues  of  the  problem 

Ax  = XBx  satisfy 


e < 6.  i0,50,i...£6  , 50  . 

112  2 n-1  n 


However,  our  main  interest  in  the  theorem  is  that  it  can  be  used  to  prove 
the  following  perturbation  theorem. 

Theorem  3.2.  Let  (A,B)  be  definite  and  let  the  eigenvalues  of 


Ax  = XBx  be  ordered  so  that  0j  5 0,  i . 

e = {||E||2+||F||2}1/2 


£ 6 . Let 
n 


and  assume  that  e < c(A,B)  so  that  (A,B)  = (A+E,B+F)  is  definite.  Let  the 
eigenvalues  of  Ax  = Xx  be  ordered  so  that  6^  - ®2  “ •••  - ®n*  ^en 


(3.3) 


Then 


lv5il  5 s“'1  ' 

Proof.  Let  X^  be  a minimizing  subspace  in  the  equality  (3.1), 


(3.4) 


0.  5 max  e[xT(A+E)x,  xT(B+F)x]  . 
1 x€Xt 

x^O 


Let  x ( X.  be  a vector  of  norm  unity  for  which  the  maximum  in  (3.4)  is 
attained.  Then  since  9(x^Ax,x^Bx)  £ 6^,  the  point  (x^(A+E)x,x^(A+F)x) 


pp 
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'1 


must  lie  in  the  circle  of  radius  e sketched  below. 


The  maximum  increase 
circle  is  situated  as 


shown  below. 


Elementary  geometry  now  gives  r|  ■ sin  * [&/c(A,B) ] , which  shows  that 

®i  ~ 6i  + r|‘  T^ie  result  ei  " 'H  - ei  follows  from  a similar  argument  applied 

to  the  characterization  (3.2)  of  0i.o 

There  are  some  observations  to  be  made  about  the  theorem.  First, 
the  bound  (3.3)  inmediately  implies  a bound  in  the  chordal  metric.  Speci- 
fically, let  X = n/v  and  X = p./v.  Then  the  chordal  distance  between 

X and  X is 


A 
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X(X,X)  = — Ip  - P 


42+v2 


But  it  is  easily  seen  that 


X(X,X)  = sin  |e(u,v)-©(ii,v)  | . 


Then  it  follows  from  (3.3)  that 


x(xi,xi) 


c 


T5OT  • 


Note  that  this  inequality  is  somewhat  weaker  than  (3.3), since  eigenvalues 
that  have  angles  differing  by  amounts  near  tr  will  have  a chordal  differ- 
ence near  zero. 

The  second  observation  that  the  theorem  implies  the  classical  bounds 
for  the  Hermitian  eigenvalue  problem.  Specifically  let  B = I,  and  let 
X^(-t)  = X^/t  be  the  eigenvalues  of  the  problem 

(3.5)  Ax  = X^ (t) (tI)x  . 


Let  c(t)  = c(A,tI)  be  the  Crawford  number  for  the  problem  (3.5).  Note  that 
when  t is  large,  c(t)  = r + 0(1).  If  we  consider  a Hermitian  perturbation 
in  A of  norm  e,  then  we  have  the  bounds 

(3.6)  iei(^-®i('r)i  - sin  1 = - + 

Since  X^(t)  and  X (t)  approach  0 as  1/t,  we  have 
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(3.7)  |ei(T)-ei(x)  | = |\i(x)-\i(x)| 

Combining  (3.6)  and  (3.7)  and  multiplying  by 

IV V 5 ‘ * °(?)- 

which  gives  the  classical  result  when  x -+  ». 

Finally  we  note  that  it  follows  from  the  results  in  [12]  that 
for  a simple  eigenvalue  X^  and  e sufficiently  small, 

X(X,  ,».,)<  5 * 0(t2)  . 

1 1 /(xHax/.Cx^)2 

2 H 2 H 2 

Since  c (A,B)  s (xVAxp  + (x-Bx^)  , it  is  seen  that  we  pay  a price  in  the 
sharpness  of  our  bounds  to  gain  freedom  from  considerations  of  multiplicity. 

4.  Perturbation  bounds:  eigenspaces 

In  this  section  we  shall  derive  perturbation  bounds  for  the  eigen- 
vectors of  the  definite  problem  (1.1).  These  bounds  imply  that  eigenvectors 
corresponding  to  poorly  separated  eigenvalues  are  very  sensitive  to  pertur- 
bations in  A and  B;  however,  the  subspace  spanned  by  the  eigenvectors 
corresponding  to  a cluster  of  eigenvalues  may  be  relative  insensitive.  We 
shall,  therefore,  phrase  our  bounds  in  terms  of  subspaces  rather  than  indi- 
vidual eigenvectors.  This  approach  is  analogous  to  the  one  taken  in  the 
ordinary  eigenvalue  problem,  where  one  bounds  perturbations  in  invariant 
subspaces  corresponding  to  clustered  eigenvalues  [4,8,9,11]. 


*°(?)' 
x,  we  get 
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We  begin  by  noting  that  if  x is  an  eigenvector  of  (1.1),  then 
Ax  and  Bx  are  dependent.  Following  [10],  we  can  generalize  this 
idea  to  subspaces. 

Definition  4.1.  A subspace  X is  an  eigenspace  of  (1.1)  if 
dim(AX+BX)  < dim  (X)  . 

Clearly  any  set  of  eigenvectors  of  (1.1)  spans  an  eigenspace.  Con- 
versely, if  the  problem  is  definite,  then  an  eigenspace  is  spanned  by  a 
set  of  eigenvectors.  To  see  this,  we  first  note  that  the  space  A^X  + 

B X,  where  A and  B are  defined  as  in  Section  2,  is  the  same  as 

<P  <P  <P 

AX  + BX.  Hence  we  may  assume  that  B is  positive  definite.  Now  let  X^ 

be  an  eigenspace  of  dimension  t spanned  by  the  columns  of  the  full  rank 

matrix  X^.  Then  dimfBXp  = £,  and  BXj  has  an  orthogonal  complement 

X2  of  dimension  n-£.  Since  B is  positive  definite  X^  @ X2  3 C0. 

T 

Let  the  columns  of  the  n * (n-£)  matrix  X2  span  X2>  so  that  X2BX^  = 0. 
But  from  the  definition  of  eigenspace  and  the  fact  that  dim(BX.)  = l, 
we  have  that  AX  c BX.  Hence  X^AXj  =0.  It  follows  that 
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simultaneously  diagonalized  by  nonsingular  matrices  Y^.  Then 
(XjYj,X2Y2)  is  nonsingular  and  diagonalizes  A and  B.  In  particular 
the  columns  of  X^Y^  are  eigenvectors  spanning  Xj. 

Turning  now  to  the  perturbation  theorem,  we  consider  an  eigen space 
X^  of  dimension  t of  the  definite  problem  (1.1).  Let  X2  be  its 
complementary  eigenspace.  Let  the  columns  of  X^  = (x^x,, . . . and 

X2  = (x£+1 xn)  be  eigenvectors  spanning  X^  and  X2,  chosen  so  that 

NL  and  NL  (i=l,2)  in  (4.1)  and  (4.2)  are  diagonal.  Set 

= diag^, . . . .u^)  Nj  * diag^ v^) 

M2  “ diag(p,^+^, . . . ,^)  N2  = diag(v^+^, . . . ,vn)  . 

We  shall  use  the  same  notation  for  the  perturbed  eigenvalue  problem 
Ax  = XBx,  except  that  all  quantities  will  be  overlined  with  tildas.  We 
shall  assume  that  both  problems  are  definite. 

We  begin  with  a lemma  which  furnishes  a possible  basis  for  the 
perturbed  eigenspace.  We  will  obtain  sharper  results  if  we  recognize 
that  certain  infima  that  are  bounded  below  by  Crawford  numbers  are  actually 
taken  over  eigenspaces.  Accordingly,  we  define 

c(A,B;X)  = inf  (|xH(A+iB)x|  : x € X,||x||=l)  . 

Lemma  4.2.  For  i = i,2,...,n  let  X^  = and  X^  = and 

let 


6 = min  (X(X^,Xj) 


i-i, ...,«;  j=«+i 


»n) 


If  6 > 0,  then  there  is  a matrix  Q whose  columns  satisfy 
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iidiii  , 

w- 


such  that 


R(Xj  + Q)  c X2  , 

where  R(X)  denotes  the  column  space  of  X. 

Proof.  Let  §2  denote  operator  defined  by  restricting  l2  t0  the 
space  X-,.  By  rotating  the  problem,  we  may  assume  that  §2  is  positive 
definite  and 

IIB^II'1  = c(A,B;X2)  . 

Note  that  under  this  assumption  N2  is  positive  definite. 

We  shall  seek  Q in  the  form 

Q = X2P  . 

From  the  definition  of  eigenspace  R(Xj+Q)  c X^  if  and  only  if 
RfBCx^+Q) ] ±X2.  This  is  equivalent  to  requiring  that  XpHXj+^P)  = 0 
or 

(4.3)  X^  = -N2P  . 

rvTrs, 

To  develop  an  expression  for  X2BXJ,  note  that  since 

(A-E)X1N1  = (B-FJX^  , 


we  have 
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Xp&iNi  - X^BXj  - X^KXNj-FXMj)  « X^R 


But  N2Xp\  = M$B,  so  that 


^X^BXjNj  - N2X2BX1M1  = N2X2R 


Hence,  because  M2  and  N2  commute,  if  we  choose  P to  satisfy 

J^PNj  - NjPMj  = -X2R  , 
then  N2P  will  satisfy  (4.3). 

Let  t * ■ = {i.v • - v-u  - Then  the  hypothesis  6 > 0 implies  that 
J ^ 1 ^ J ^ 

Tji  ^ 0 (i*l»  • • • ,*•'»  j=£+l,  • • • »n) . Consequently  P is  uniquely  defined 

and  its  (j-K.,i)  element  is  given  by 

~T 

x:i\ 

Pj-£,i  t.  . ’ 

J • ij 

where  denotes  the  i-th  column  of  R.  It  follows  that  the  i-th  column 
of  Q = X2P  is  given  by 


■■  ■ U 


1 r.  = S.r.  , 

+1  Tij  / 1 11 


and  our  problem  is  reduced  to  determining  bounds  on  I|SJ|. 

~ „ ~-l/2 

Let  Y2  * (yj,+i»y£+2 yn-*  = X2N2  ’ 111611 


11  V • rp 

Si  = ^ t- 

1 je£+l  lj  J J 
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But  if  By  is  the  operator  defined  at  the  beginning  of  the  proof 

}2«2*2  * S21/2*2*2VI21/2  * 1 - 

so  that  the  columns  of  6^^Y2  are  orthonormal.  Hence  the  eigenvalues 
of  B^S-  B^2  are  Hence 

1 ~2  j ij 


C4.4)  llsjl  =■  Il|21/2tl2/2si62/2^21/211  - l6'2lll^2/2si^2/211 

maxj{|vj/Tij|}  max.{|y^?>v?/Ti.|} 


where  &2  = c (A,B ;X2) 
Now  from  (4.5) 


(4.5) 


qi  = Siri  = Si  (Exivi+Fx^  p.i)  . 


Hence 


(4.6) 


llqjll  2 ||Si||||x.||e^T  . 


Combining  (4.5)  and  (4.6)  we  see  that 


*i"  e 


^i^i 


- j i ~ ~ , ~ ~ c 

i c2  c26 


The  bound  in  the  theorem  may  be  written  in  the  form 


IIQ"f  t 
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2 2 

where  ||X||p  s Z|x^|  denotes  the  Frobenius  norm  of  X.  Thus  the  bound 
depends  directly  on  the  perturbation  and  inversely  on  the  gap  between 
the  eigenvalues,  with  the  Crawford  number  determining  the  size  of  the 
effect.  Unfortunately  R(X.-K}^)  need  not  be  an  eigenspace  unless 
dim  [RCXj+Q^]  ■ SL.  When  £ = 1,  so  that  Xj  is  the  single  eigenvector 
Xp  we  can  assure  this  by  requiring  that  Hq^H  < ||x^||. 

Theorem  4.3.  Let  6 = min  {X(XpXj)  : j=2,3, . . . ,n) . If  e/6  < 
c(A,B;X2), then  there  is  a vector  q^  satisfying 

*l'  . _6 . 

6c(A,B;X2) 

IV  <v«*  <*> 

such  that  x^  + q is  an  eigenvector  of  Ax  = XBx  corresponding  to  X^. 

When  !i  < 1,  we  must  take  into  account  the  effect  of  near  dependencies 
among  the  columns  of  X.  Define 

inf(X, ) = inf  ||X.x||  . 

1 l|x||=l  1 

Then  if  ||Q||  < inf(Xp),  rank(X+Q)  = rank(X)  = Z.  These  considerations  lead 
to  the  following  theorem. 

Theorem  4.4.  Let 


If  Tj  < X then  R(X1+Q)  = ~Xy 

Proof.  It  follows  from  the  proof  of  Theorem  4.3  that  if  x^  is 
scaled  by  a factor  a,  then  q^  is  scaled  by  the  same  factor.  Hence  we 
may  assume  that  |Jxj  |J  = 1.  We  first  find  a lower  bound  on  inf(Xj).  By 
rotating  (A,B),  we  may  assume  that  if  denotes  the  restriction  of  B 

to  X^  then  B-^  is  positive  definite  and  inf (B^)  = c(A,B;Xj).  Of  course, 


(4.7) 
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II B,  ||  S HBII  5 /lAH^IlBH2  . 


Now 


xTb.X,  = N., 

1~1  1 1 

1/2  -1/2 

from  which  it  follows  that  the  colunns  of  U = B.  X.N.  are  orthonormal. 

~1  11 

Hence 


1 = inf  (U)  < ||Bj/2||||Nj1/2||inf(X1)  . 


Since  the  columns  of  Xj  have  nonm  unity,  we  have  > c(A,B;Xj).  Hence 
from  (4.7) 


(4.8) 


But 


inf  (x)  > 


c(A,B;X1) 

4aii2+iibh2 


(4.9) 


and  the  result  follows  from  (4.9)  and  (4.10).o 

When  £ = 1,  Theorem  4.5  does  not  reduce  to  Theorem  4.3  because  we 
have  given  too  much  away  in  the  bound  (4.9).  The  principal  application 
is  to  the  case  i > 1,  where  the  condition  n < 1 not  only  guarantees  that 
X = R(Xj+Q)  but  also  implies  that  and  Xj  are  acutely  situated  with 
respect  to  one  another.  In  fact  a slight  modification  of  the  proof  of 
Theorem  4.1  in  [13]  gives  the  following  corollary. 
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Corollary  4.5.  Let  P^  and  P-^  denote  the  orthogonal  projections 


onto  Xj  and  X^.  If  n < 1,  then 


»VpVf*  ft  ■ 


It  is  unfortunate  that  q contains  the  factor  /£,  since  it  grows 
with  dimension  of  X^.  The  presence  of  this  factor  is  a direct  consequence 
of  the  fact  that  Lemma  4.2  bounds  IIQII p/ l|X||p  instead  of  ||Q||/||F||.  For  the 
Hermitian  eigenvalue  problem,  Davis  and  Kahan  [4]  have  been  able  to  obtain 
bounds  in  the  spectral  norm  imposing  additional  restrictions  on  the  loca- 
tion of  the  eigenvalues.  Whether  such  bounds  can  be  obtained  for  the 
definite  generalized  eigenvalue  problem  is  an  open  question. 
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